Search CORE

321 research outputs found

Rise of the Machines

Author: A Srivatsan
AM Earl
B Wold
CM Egan
D Field
D Gresham
D Hernandez
DA Wheeler
David Gresham
ES Lander
F Kunst
F Sanger
H Li
J Schacherer
Leonid Kruglyak
LW Hillier
R Li
S Levy
Wayne N. Frankel
Publication venue: Public Library of Science
Publication date: 01/08/2008
Field of study

Crossref

Directory of Open Access Journals

PubMed Central

Design of a combinatorial DNA microarray for protein-DNA interaction studies

Author: CE Lawrence
CL Warren
CT Harbison
EH Davidson
H Bolouri
JD Hughes
JK Wang
Julian Mintseris
LW Hillier
Michael B Eisen
ML Bulyk
ML Bulyk
RD Egeland
S Mukherjee
SS Skiena
TI Lee
TJ Albert
V Matys
X Liu
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Discovery of precise specificity of transcription factors is an important step on the way to understanding the complex mechanisms of gene regulation in eukaryotes. Recently, double-stranded protein-binding microarrays were developed as a potentially scalable approach to tackle transcription factor binding site identification. RESULTS: Here we present an algorithmic approach to experimental design of a microarray that allows for testing full specificity of a transcription factor binding to all possible DNA binding sites of a given length, with optimally efficient use of the array. This design is universal, works for any factor that binds a sequence motif and is not species-specific. Furthermore, simulation results show that data produced with the designed arrays is easier to analyze and would result in more precise identification of binding sites. CONCLUSION: In this study, we present a design of a double stranded DNA microarray for protein-DNA interaction studies and show that our algorithm allows optimally efficient use of the arrays for this purpose. We believe such a design will prove useful for transcription factor binding site identification and other biological problems

Crossref

Boston University Institutional Repository (OpenBU)

Springer - Publisher Connector

PubMed Central

UNT Digital Library

Improving the Efficiency of Physical Examination Services

Author: A Raouf
Aaron E. Bair
AG Kalton
AM Law
AM Law
BM Reilly
CJ Schwind
D Fone
D Goldsman
D Krahl
D Miller
DA Nardone
DC Montgomery
DF Salerno
DH Kropp
FS Hillier
G Wullink
JB Jun
JE Clague
JG Nomden
JJ Swain
JPC Kleijnen
L Aharonson-Daniel
LW Friedman
LW Schruben
LW Schruben
M Babes
Mingchang Chih
NK Kwak
P Dull
R Indra
RH Edwards
SM Shechter
V Podgorelec
W England
W Vogt
W-MT Song
WB Nordgren
WD Kelton
Wheyming Tina Song
Publication venue: Springer US
Publication date: 01/01/2009
Field of study

The objective of our project was to improve the efficiency of the physical examination screening service of a large hospital system. We began with a detailed simulation model to explore the relationships between four performance measures and three decision factors. We then attempted to identify the optimal physician inquiry starting time by solving a goal-programming problem, where the objective function includes multiple goals. One of our simulation results shows that the proposed optimal physician inquiry starting time decreased patient wait times by 50% without increasing overall physician utilization

Crossref

Springer - Publisher Connector

PubMed Central

eScholarship - University of California

Annotation of two large contiguous regions from the Haemonchus contortus genome using RNA-seq and comparative analysis with Caenorhabditis elegans

Author: A Coghlan
A Coghlan
A Couthier
AJ Wolstenholme
Anna V. Protasio
C Liu
Clotilde K. S. Carlow
DB Guiliano
DL Laughton
DL Redmond
DP Knox
E Ghedin
E Redman
F Jackson
Frank Jackson
Gary Saunders
H Li
H Li
J Parkinson
J Spieth
JC Abbott
JH Graber
JL Bessereau
JM Ranz
John S. Gilleard
JR Vanfleteren
JS Gilleard
JS Gilleard
K Rutherford
Karen Mungall
L Duret
L Duret
L Rufener
LD Stein
LF LeJambre
LW Hillier
M Caceres
M Deutsch
Martin Hunt
Matthew Berriman
Michael Quail
MJ Callaghan
PS Chain
R Hoekstra
R Kaminsky
R Prichard
Robin Beech
Roz Laing
S Chen
S Leroy
Steven Laing
T Blumenthal
T Carver
TJ Carver
V Grillo
W Qian
Y Tanizawa
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 15/08/2011
Field of study

The genomes of numerous parasitic nematodes are currently being sequenced, but their complexity and size, together with high levels of intra-specific sequence variation and a lack of reference genomes, makes their assembly and annotation a challenging task. Haemonchus contortus is an economically significant parasite of livestock that is widely used for basic research as well as for vaccine development and drug discovery. It is one of many medically and economically important parasites within the strongylid nematode group. This group of parasites has the closest phylogenetic relationship with the model organism Caenorhabditis elegans, making comparative analysis a potentially powerful tool for genome annotation and functional studies. To investigate this hypothesis, we sequenced two contiguous fragments from the H. contortus genome and undertook detailed annotation and comparative analysis with C. elegans. The adult H. contortus transcriptome was sequenced using an Illumina platform and RNA-seq was used to annotate a 409 kb overlapping BAC tiling path relating to the X chromosome and a 181 kb BAC insert relating to chromosome I. In total, 40 genes and 12 putative transposable elements were identified. 97.5% of the annotated genes had detectable homologues in C. elegans of which 60% had putative orthologues, significantly higher than previous analyses based on EST analysis. Gene density appears to be less in H. contortus than in C. elegans, with annotated H. contortus genes being an average of two-to-three times larger than their putative C. elegans orthologues due to a greater intron number and size. Synteny appears high but gene order is generally poorly conserved, although areas of conserved microsynteny are apparent. C. elegans operons appear to be partially conserved in H. contortus. Our findings suggest that a combination of RNA-seq and comparative analysis with C. elegans is a powerful approach for the annotation and analysis of strongylid nematode genomes

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Enlighten

Inherent Signals in Sequencing-Based Chromatin-ImmunoPrecipitation Control Libraries

Author: AA Bhinge
CL Wei
CY Lin
D Karolchik
DE Schones
DS Johnson
Edwin Cheung
G Bourque
I. King Jordan
JC Dohm
L Conti
LW Hillier
Nallasivam Palanisamy
S Impey
TS Mikkelsen
VB Vega
Vinsensius B. Vega
Wing-Kin Sung
X Chen
Y Benjamini
Y Zhang
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

The growth of sequencing-based Chromatin Immuno-Precipitation studies call for a more in-depth understanding of the nature of the technology and of the resultant data to reduce false positives and false negatives. Control libraries are typically constructed to complement such studies in order to mitigate the effect of systematic biases that might be present in the data. In this study, we explored multiple control libraries to obtain better understanding of what they truly represent.First, we analyzed the genome-wide profiles of various sequencing-based libraries at a low resolution of 1 Mbp, and compared them with each other as well as against aCGH data. We found that copy number plays a major influence in both ChIP-enriched as well as control libraries. Following that, we inspected the repeat regions to assess the extent of mapping bias. Next, significantly tag-rich 5 kbp regions were identified and they were associated with various genomic landmarks. For instance, we discovered that gene boundaries were surprisingly enriched with sequenced tags. Further, profiles between different cell types were noticeably distinct although the cell types were somewhat related and similar.We found that control libraries bear traces of systematic biases. The biases can be attributed to genomic copy number, inherent sequencing bias, plausible mapping ambiguity, and cell-type specific chromatin structure. Our results suggest careful analysis of control libraries can reveal promising biological insights

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

ScholarBank@NUS

The development and characterization of a 60K SNP chip for chicken

Author: Addie Vereijken
AM Ramos
C Rubin
CP Van Tassell
DA Magee
GK Wong
H Li
H-J Megens
Hans H Cheng
Hendrik-Jan Megens
KL Gunderson
LaDeana W Hillier
LK Matukumalli
LW Hillier
M Garber
MAM Groenen
MAM Groenen
Martien AM Groenen
MG Elferink
P Green
P Wahlberg
RA Dalloul
Richard PMA Crooijmans
Ron Okimoto
RPMA Crooijmans
WC Warren
Wesley C Warren
William M Muir
WM Muir
Yalda Zare
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background In livestock species like the chicken, high throughput single nucleotide polymorphism (SNP) genotyping assays are increasingly being used for whole genome association studies and as a tool in breeding (referred to as genomic selection). To be of value in a wide variety of breeds and populations, the success rate of the SNP genotyping assay, the distribution of the SNP across the genome and the minor allele frequencies (MAF) of the SNPs used are extremely important. Results We describe the design of a moderate density (60k) Illumina SNP BeadChip in chicken consisting of SNPs known to be segregating at high to medium minor allele frequencies (MAF) in the two major types of commercial chicken (broilers and layers). This was achieved by the identification of 352,303 SNPs with moderate to high MAF in 2 broilers and 2 layer lines using Illumina sequencing on reduced representation libraries. To further increase the utility of the chip, we also identified SNPs on sequences currently not covered by the chicken genome assembly (Gallus_gallus-2.1). This was achieved by 454 sequencing of the chicken genome at a depth of 12x and the identification of SNPs on 454-derived contigs not covered by the current chicken genome assembly. In total we added 790 SNPs that mapped to 454-derived contigs as well as 421 SNPs with a position on Chr_random of the current assembly. The SNP chip contains 57,636 SNPs of which 54,293 could be genotyped and were shown to be segregating in chicken populations. Our SNP identification procedure appeared to be highly reliable and the overall validation rate of the SNPs on the chip was 94%. We were able to map 328 SNPs derived from the 454 sequence contigs on the chicken genome. The majority of these SNPs map to chromosomes that are already represented in genome build Gallus_gallus-2.1.0. Twenty-eight SNPs were used to construct two new linkage groups most likely representing two micro-chromosomes not covered by the current genome assembly. Conclusions The high success rate of the SNPs on the Illumina chicken 60K Beadchip emphasizes the power of Next generation sequence (NGS) technology for the SNP identification and selection step. The identification of SNPs from sequence contigs derived from NGS sequencing resulted in improved coverage of the chicken genome and the construction of two new linkage groups most likely representing two chicken micro-chromosomes.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Digital Commons@Becker

Wageningen University & Research Publications

Purdue E-Pubs

Exploring the Genetic Basis of Variation in Gene Predictions with a Synthetic Association Study

Author: A Krogh
A Lomsadze
AL Delcher
Andrew M. Glazer
C Burge
E Cadieu
G Parra
GJM Hersbach
I Korf
J Ule
Jason E. Stajich
KR Chi
Lior Pachter
LW Hillier
M Stanke
M Uda
MA van den Berg
Michael B. Eisen
P Bakke
Rachel B. Brem
RJ Klein
Tera C. Levin
VG Cheung
W Zhang
Publication venue: Public Library of Science
Publication date: 29/07/2010
Field of study

Identifying DNA polymorphisms that affect molecular processes like transcription, splicing, or translation typically requires genotyping and experimentally characterizing tissue from large numbers of individuals, which remains expensive and time consuming. Here we introduce an alternative strategy: a “synthetic association study” in which we computationally predict molecular phenotypes on artificial genomes containing randomly sampled combinations of polymorphic alleles, and perform a classical association study to identify genotypes underlying variation in these computationally predicted annotations. We applied this method to characterize the effects on gene structure of 32,792 single-nucleotide polymorphisms between two strains of the antibiotic producing fungus Penicilium chrysogenum. Although these SNPs represent only 0.1 percent of the nucleotides in the genome, they collectively altered 1.8 percent of predicted gene models between these strains. To determine which SNPs or combinations of SNPs were responsible for this variation, we predicted protein-coding genes in 500 intermediate genomes, each identical except for randomly chosen alleles at each SNP position. Of 30,468 gene models in the genome, 557 varied across these 500 genomes. 226 of these polymorphic gene models (40%) were perfectly correlated with individual SNPs, all of which were within or immediately proximal to the affected gene. The genetic architectures of the other 321 were more complex, with several examples of SNP epistasis that would have been difficult to predict a priori. We expect that many of the SNPs that affect computational gene structure reflect a biologically unrealistic sensitivity of the gene prediction algorithm to sequence changes, and we propose that genome annotation algorithms could be improved by minimizing their sensitivity to natural polymorphisms. However, many of the SNPs we identified are likely to affect transcript structure in vivo, and the synthetic association study approach can be easily generalized to any computed genome annotation to uncover relationships between genotype and important molecular phenotypes

Crossref

Directory of Open Access Journals

PubMed Central

Caltech Authors

Differential radio-sensitivities of human chromosomes 1 and 2 in one donor in interphase- and metaphase-spreads after 60Co γ-irradiation

Author: A Wojcik
Adarsh Ramakumar
AT Natarajan
E Schmid
ES Lander
F Darakhshan
F Darroudi
J Meunier
JF Barquinero
JL Fernandez
JN Lucas
K George
K Lee
LW Hillier
M Grigorova
MR Branco
Pataje GS Prasanna
PG Prasanna
PG Prasanna
PJ Simpson
R Lee
RC Wilkins
RJ Davis
Rupak Pathak
S Knehr
S Luomahaara
S Sommer
T Cremer
TE Takasuka
TK Pandita
Uma Subramanian
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Using ESTs to improve the accuracy of de novo gene prediction

Author: A Krogh
AA Salamov
AC Siepel
C Wei
Chaochun Wei
DR Maglott
E Birney
I Korf
JE Allen
JE Allen
KD Pruitt
KD Pruitt
KL Howe
L Stein
LW Hillier
M Stanke
MG Reese
Michael R Brent
MJ van Baren
MR Brent
MS Boguski
P Flicek
R Guigo
R Guigó
R Mott
RA Gibbs
RH Brown
RH Waterston
S Foissac
SS Gross
The MGC Project Team
TW Harris
TW Harris
VV Solovyev
WJ Kent
Publication venue: BioMed Central
Publication date: 01/07/2006
Field of study

BACKGROUND: ESTs are a tremendous resource for determining the exon-intron structures of genes, but even extensive EST sequencing tends to leave many exons and genes untouched. Gene prediction systems based exclusively on EST alignments miss these exons and genes, leading to poor sensitivity. De novo gene prediction systems, which ignore ESTs in favor of genomic sequence, can predict such "untouched" exons, but they are less accurate when predicting exons to which ESTs align. TWINSCAN is the most accurate de novo gene finder available for nematodes and N-SCAN is the most accurate for mammals, as measured by exact CDS gene prediction and exact exon prediction. RESULTS: TWINSCAN_EST is a new system that successfully combines EST alignments with TWINSCAN. On the whole C. elegans genome TWINSCAN_EST shows 14% improvement in sensitivity and 13% in specificity in predicting exact gene structures compared to TWINSCAN without EST alignments. Not only are the structures revealed by EST alignments predicted correctly, but these also constrain the predictions without alignments, improving their accuracy. For the human genome, we used the same approach with N-SCAN, creating N-SCAN_EST. On the whole genome, N-SCAN_EST produced a 6% improvement in sensitivity and 1% in specificity of exact gene structure predictions compared to N-SCAN. CONCLUSION: TWINSCAN_EST and N-SCAN_EST are more accurate than TWINSCAN and N-SCAN, while retaining their ability to discover novel genes to which no ESTs align. Thus, we recommend using the EST versions of these programs to annotate any genome for which EST information is available. TWINSCAN_EST and N-SCAN_EST are part of the TWINSCAN open source software package

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

BowStrap v1.0: Assigning statistical significance to expressed genes using short-read transcriptome data

Author: A Mortazavi
A Roberts
B Langmead
F Martin
Frank R Collart
GA Tuskan
JC Marioni
LW Hillier
N Whiteford
PE Larsen
PE Larsen
Peter E Larsen
U Nagalakshmi
Y Benjamini
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref